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A.  ABSTRACT 

A  multiplexed  screening  system  to  detect 
pathogenicity  islands  (PI)  of  bacteria  causing  enteric 
disease  and  pathogenicity  factors  (PF)  associated  with  the 
SARS-associated  coronavims  (SARS-CoV)  and  Rift 
Valley  Fever  Virus  (RVFV)  has  been  developed.  This 
system  is  based  upon  the  Luminex  xMAP™  System,  a 
multiplexed  assay  platform  that  combines  high  sample 
throughput  (up  to  600  samples/hour)  with  high 
information  content  per  sample  (up  to  100  parameters 
tested  simultaneously  per  sample).  Pathogenic  bacteria 
contain  DNA  sequences  (pathogenicity  islands)  that  code 
for  proteins  enhancing  the  ability  of  the  bacteria  to  cause 
disease  in  the  host.  These  virulence  determinants  may 
enhance  binding  of  the  bacterium  to  host  tissues,  increase 
the  uptake  of  necessary  nutrients  from  the  environment 
into  the  bacterium,  or  function  as  toxins  that  cause  the 
survival  of  the  host  cell  to  become  compromised. 
Whereas  viruses  do  not  possess  pathogenicity  islands, 
they  do  possess  genomic  sequences  (PF)  that  are 
associated  with  their  pathogenicity.  By  quickly  screening 
for  multiple  pathogenicity  island  and  factor  sequences, 
end-users  will  have  the  capability  to  detect  the  first  signs 
of  an  enteric  or  viral  bioattack  without  requiring  screening 
for  a  particular  organism.  This  detection  system  will  have 
hundreds  of  probes  (multiplexed),  so  false  positives  and 
false  negatives  can  be  reduced  and  managed. 

Proof  of  principle  for  this  system  was  established 
generating  probes  (>20  nucleotides  long)  designed  to 
react  with  the  following  PI:  Hut  A,  eae,  ipaA,  stx,  ctx, 
shuA,  jet ,  and  iuc.  Probes  were  also  prepared  to  react  to 
the  M  and  N  genes  of  the  SARS-CoV  and  the  NSs  and  L 
genes  of  RVFV.  A  multiplexed  system  to  screen  for 
multiple  Pis  and  for  viral  pathogenicity  factors  was 


developed.  Two  sequences  for  each  of  the  SARS-CoV  (M 
and  N)  and  the  RVFV  (NSs  and  L)  were  prepared.  Data 
demonstrate  a  strong  signal  to  noise  ratio.  The  proposed 
screening  system  has  commercial  applications  in  office 
health  monitoring,  hospital  infectious  disease  detection, 
environmental,  agricultural  and  food-borne  pathogen 
monitoring.  As  new  pathogenicity  islands  and  factors  are 
discovered,  or  mutations  are  determined,  new 
microsphere  probe  sets  that  carry  new  sequences  can  be 
included  to  seamlessly  update  any  developed  kit. 

B.  INTRODUCTION 

The  presence  of  pathogens  in  the  environment  is 
usually  recognized  only  after  illnesses  have  resulted  from 
infection  by  the  pathogen  or  its  toxic  products.  The  time 
interval  between  exposure  of  a  target  to  bacterial  or  viral 
threat  agent  and  the  development  of  clinical  signs 
approximates  between  36  hours  to  ten  days.  During  this 
interval,  the  exposed  subject  may  disseminate  disease  to  a 
second  generation  of  patients  and  exacerbate  the 
management  problem;  however,  antibiotics  and  antivirals 
can  also  be  administered  to  ameliorate  the  development  of 
disease.  Toxins  can  cause  clinical  signs  to  appear  within 
minutes  to  days  but  the  dissemination  of  disease  to  a  new 
population  is  very  unlikely.  The  impact  of  these  infectious 
agents  on  human  and  livestock  targets  would  be  greatly 
reduced  if  the  threat  agent  is  detected  prior  to  or  soon 
after  exposure. 

Pathogenic  bacteria,  viruses  and  fungi  can  be 
detected  by  a  variety  of  antibody,  metabolic,  or  DNA 
based  assays;  toxins  are  most  readily  detected  by  antibody 
based  sensors  or  devices  that  measure  the  effect  of  the 
toxin  on  metabolism  of  cells.  If  performed  appropriately, 
genomic  based  detection  systems  readily  detect  known 
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threat  agents  with  a  high  degree  of  certainty  (very  low  to 
no  false  positives  or  negatives).  This  is  because  the 
pathogenic  properties  of  an  organism  reside  in  the 
genome.  For  example,  acquisition  of  the  phage -borne 
pathogenicity  island  sit  gene  by  nonpathogenic  E.  coli 
could  lead  to  the  emergence  of  hemorrhagic  colitis  caused 
by  a  non-0157:H7  strain.  Of  equal  concern  is  the 
possibility  that  the  toxin  gene,  along  with  other  essential 
virulence  determinants,  could  be  introduced  into  an  E. 
coli  or  Shigella  strain  that  does  not  match  the  known 
pathogenic  serotypes  or  even  into  a  bacterial  strain  that 
causes  respiratory  infections,  thus  delivering  the  toxin 
through  a  potentially  more  lethal  route.  For  this  reason, 
tools  to  rapidly  identify  specific  genes  that  are  essential 
for  production  of  disease  in  the  host  are  being  developed. 
By  screening  for  a  large  number  of  viridence 
determinants,  many  of  which  are  found  within 
pathogenicity  islands,  patterns  of  genes  present  in  a 
potential  pathogen  can  be  identified.  The  set  of  virulence 
determinants  present  indicates  the  type  of  disease  that 
could  be  produced.  The  presence  of  genes  required  for 
virulence  is  a  more  reliable  indicator  of  pathogenicity 
than  antigenic  or  metabolic  characteristics  that  could  vary 
independently  of  disease  potential.  This  is  because  the 
antigenic  properties  of  many  pathogens  are  very  similar  to 
that  of  non-pathogenic  organisms  (e.g.  bacillus  anthracis 
and  bacillus  cereus)  and  so  false  positives  are 
confounding. 

With  the  emergence  of  annual  outbreaks  of  specific 
pathogenic  viruses  (e.g.,  SARS,  West  Nile  Virus),  along 
with  the  danger  of  viral  bioattack,  it  has  become 
increasingly  important  to  rapidly  detect  and  identify  these 
and  other  pathogenic  viruses.  With  respect  to  the  SARS- 
CoV,  it  has  an  enveloped  positive-stranded  RNA  virus 
with  a  single  stranded  genome  between  27  kb  and  31.5 
kb.  The  following  are  some  of  the  structural  genes  found 
in  this  virus:  S,  E,  M  and  N.  The  S  protein  is  a  major 
structural  protein  that  mediates  the  fusion  activity  that  is 
observed  in  coronavirus-infected  cells  and  it  plays  an 
important  role  in  virus  attachment  to  the  coronavirus- 
specific  receptor.  The  M  and  E  proteins  are  essential  for 
efficient  production  of  virus  envelope  and  release 
(Vennema  et  al.,  1996).  Additionally,  the  M  protein  plays 
a  critical  role  in  incorporation  of  S  and  N  proteins  into  virus 
particles  (de  Flaan  et  al.,  1999;  Narayanan  et  al.,  2000; 
Narayanan  et  al.,  2001,  Narayanan  et  al.,  2003).  With 
respect  to  RVFV,  it  too  has  several  known  genes,  two  of 
them  being  NSs  and  L  (Billecocq  et  al.,  2004;  Lopez  et  al., 
1995).  All  of  these  relatively  conserved,  structurally 
critical  genes  found  in  SARS-CoV  and  RVFV  may  serve 
as  specific  identifiers  to  the  presence  of  a  specific  virus. 

This  report  describes  the  preparation  of  synthetic 
DNA  probes  designed  to  detect  amplicons  of  enteric 
pathogenic  bacteria,  the  SARS-CoV,  and  the  RVFV  and 
their  use  with  a  multiplexed  platform  that  has  the 


capability  to  assay  several  hundred  samples  per  hour.  The 
challenge  of  the  multiplexed  system  is  to  develop 
appropriate  probes  that  will  read  multiple  amplicons  in  a 
mixture  of  materials  with  minimal  interference  and  cross 
reactions  between  the  selected  probes  or  amplicons.  An 
associated  challenge  is  to  develop  probes  that  operate  at 
similar  melting  temperatures  so  that  a  single  protocol  will 
be  appropriate  for  assay  conditions. 

C.  MATERIALS  &  METHODS 

1.  Preparation  of  Amplicons  and  Genomic  Probes  for 
Enteropathogenic  Bacteria,  the  N  and  M  Genes  of  the 
SARS-CoV  and  the  NSs  and  L  genes  of  the  RVFV 

Biotinylated  amplicons  for  pathogenicity  islands  of 
Shigella  dysenteriae  and  S.  flexneri  ( shuA ,  stx,  ipaA,  iuc ), 
Vibrio  cholerae  ( hutA ,  ctx),  and  E.  coli  0 1 57:H7  ( shuA , 
eaejct )  were  prepared  (between  three  and  four  probes  for 
each  amplicon  were  made).  Six  unique  oligonucleotide 
probes  were  designed  and  prepared  for  each  of  the  two 
amplicon  sequences  (N  &  M)  of  the  SARS-CoV,  while 
each  of  these  probes  were  covalently  coupled  to  a  unique 
bead.  Finally,  after  identifying  specific  genomic  regions 
within  RVFV  that  code  for  the  NSs  and  L  proteins,  six 
unique  biotinylated  oligonucleotide  amplicons  were 
prepared.  It  should  be  noted  that  all  probes  made  for  this 
work  were  generated  from  genomic  sequences  that  are 
found  in  the  public  domain  at: 
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi7dlFdSlucle 

otide&itool=toolbar. 

2.  Technology  of  Multiplexed  Assays  using  the 
Luminex  xMAP  System. 

The  Luminex  xMAP  system  provides  a  rapid  and 
cost  effective  method  to  simultaneously  detect  the 
presence  of  specific  DNA  or  RNA  sequences  of  many 
different  bacterial  and  viral  pathogens.  The  xMAP  system 
utilizes  polystyrene  microspheres  internally  dyed  with 
two  spectrally  distinct  fluorochromes.  Using  precise  ratios 
of  these  fluorochromes,  a  suspension  array  is  created 
consisting  of  100  spectrally  distinct  microsphere  sets. 
Each  microsphere  set  can  possess  a  different  reactant  on 
its  surface,  and  up  to  100  different  analytes/sequences  can 
be  detected  simultaneously  in  a  single  reaction  vessel.  A 
third  tluorochrome  coupled  to  a  reporter  molecule 
quantifies  the  biomolecular  interaction  that  has  occurred 
at  the  microsphere  surface.  The  multiplexing  capability  of 
the  xMAP  system  provides  an  assay  detection  platform 
that  does  not  require  users  to  prejudicially  screen  for 
specific  organisms,  but  allows  a  more  thorough  screen  for 
the  presence  of  biowarfare,  food-borne,  and 
environmental  pathogens. 


2 


3.  Development  of  Luminex  xMAP  Screening  of 
Amplicons  of  Enteropathogenic  Bacteria,  SARS-CoV 
and  RVFV 

Each  oligonucleotide  probe  used  in  this  study  was 
covalently  coupled  to  a  unique  bead.  Amplicons  were 
screened  under  hybridization  conditions  (48°C)  optimized 
to  permit  hybridization  of  all  the  probes  in  the  set.  The 
sensitivity  of  the  Luminex  xMAP  system  has  been 
examined  in  two  ways:  1)  by  spiking  gloves  with  Shigella 
dysenteriae  and  2)  by  calculating  the  theoretical  number 
of  organisms  that  can  be  detected  after  the  Polymerase 
Chain  Reaction  (PCR).  Individual  gloves  were 
contaminated  with  1.5  x  10s,  1.5  x  106,  or  1.5  x  102cfu  of 
Shigella  dysenteriae.  Gels  containing  the  products  of  the 
PCR  reactions  from  the  gloves  were  run  and  clear  bands 
were  detected  from  the  150  cfu  sample.  The  Luminex 
system  has  also  been  shown  capable  of  detecting  1 
attamole  of  DNA  per  sample  (106  molecules  of  DNA). 
After  35  PCR  cycles,  DNA  template  from  very  few 
bacteria  (less  than  100)  will  yield  a  clearly  detectable 
signal  at  the  attamole  level. 


D. RESULTS 

1.  Bacterial  Pathogenicity  Islands  and  Associated 
Toxins 

A  number  of  genes  in  enteric  bacteria  are  found  in 
pathogens  but  are  generally  absent  from  the  non- 
pathogenic  strains.  These  include  iron  transport  genes 
such  as  the  heme  transport  genes  (shu  or  elm)  of  Shigella 
and  pathogenic  E.  coli,  as  well  as  the  aerobactin  ( iuc)  iron 
transport  system  (Mills  and  Payne,  1995,  1997;  Purdy  and 
Payne,  2002;  Torres  and  Payne,  1997;  Vokes  et  al.,  1999; 
Wyckoff  et  ah,  1998).  The  distribution  of  these  genes 
among  a  variety  of  pathogenic  and  non-pathogenic  enteric 
bacteria  has  been  characterized.  These  data,  along  with  an 
analysis  of  the  genome  data  for  other  virulence  factors, 
have  revealed  patterns  of  genes  that  are  associated  with 
specific  pathogens.  Many  of  these  virulence  genes  map 
within  pathogenicity  islands,  bacteriophages,  or  plasmids. 
Table  1  shows  a  sample  of  these  data. 


Each  pathotype  has  a  distinctive  pattern  of  virulence 
genes  or  accessory  genes  present  in  the  genome.  By 
identifying  the  pattern,  the  species  or  strain  of  enteric 
pathogen  can  be  accurately  determined.  Because  some  of 
these  genes  are  responsible  for  the  symptoms  and 
pathology  of  the  diseases,  detection  of  the  gene  pattern 
allows  prediction  of  the  type  of  disease  that  will  result 
from  infection  with  the  strains.  For  example, 
identification  of  a  strain  that  carries  genes  for  the  shiga 
toxin  (.s' tv),  attaching  and  effacing  lesions  (eae)  and  heme 
transport  (shu)  indicates  that  it  has  the  potential  to  cause 
hemorrhagic  colitis  ( E .  coli  0157:H7,  for  example). 
Because  the  Radix  BioSolutions  technology  allows  us  to 
screen  for  many  different  DNA  sequences 
simultaneously,  probes  for  highly  conserved  regions  in 
each  gene  were  used  to  detect  the  presence  of  the  genes, 
as  well  as  probes  to  variable  regions  to  identify  variants 
(e.g.,  stxl  and  stx2,  or  S.  enterica  iro  vs.  S.  dysenteriae 
iro). 

Many  of  these  genes  map  within  pathogenicity 
islands.  The  virulence  genes  within  the  islands,  such  as 
the  iuc  genes,  may  be  highly  conserved.  Other  genes  or 
sequences  within  the  island  may  be  unique  to  that  strain. 
The  iuc  genes  in  S.  flexneri  and  S.  boydii  are  essentially 
identical  but  other  island  sequences  adjacent  to  the  iuc 
genes  are  distinct  in  each  species.  The  location  of  the 
island  is  different  in  these  species.  The  iuc  island  maps 
downstream  of  selC  in  S.  flexneri  but  is  downstream  of 
pheU  in  S.  boydii.  Probes  that  recognize  the  non -iuc 
island  sequences  and  the  junctions  between  the  island  and 
the  backbone  genome  can  easily  distinguish  between 
these  two  species. 

2.  Probe  Specificity  for  Viral  Pathogenic  Factor 
Amplicons 

Synthetic  DNA  probes  were  designed  to  detect 
potential  amplicons  of  specific  genomic  regions  found  in 
pathogenic  viruses.  In  Figure  la,  a  negative  control  (-) 
shows  that  no  amplification  of  the  PCR  products  was 
detected  in  the  absence  of  probe  sets.  However,  when 
probes  were  included  in  the  PCR  mixture  (+),  a 
homogeneous  band  at  300  base  pairs  (bp)  was  present. 


Table  1.  Virulence  Genes 


Strain 

iuc 

shu 

ipaB 

stx 

eae 

sit 

iro 

ybt 

E.  coli  K12 

- 

- 

- 

- 

- 

- 

- 

- 

Shigella  flexneri 

+ 

- 

+ 

- 

- 

+ 

- 

- 

Shigella  dysenteriae 

- 

+ 

+ 

+ 

- 

+ 

+ 

- 

Shigella  sonnei 

- 

- 

+ 

- 

- 

+ 

- 

- 

E.  coli  01 57  :H7 

- 

+ 

- 

+ 

+ 

- 

- 

+ 

Enteropathogenic  E.  coli 

- 

+ 

- 

- 

+ 

- 

- 

- 

Uropathogenic  E.  coli 

+ 

+ 

- 

- 

- 

+ 

+ 

+ 

Salmonella  enterica 

rare 

- 

- 

- 

- 

+ 

+ 

- 

3 


Therefore,  there  are  specific  amplicons  for  the  N  and  M 
genes  of  SARS-CoV.  Likewise,  a  similar  result  was 
obtained  for  the  NSs  and  L  genes  for  RVFV  (Figure  lb.). 
This  gel  reveals  the  homogeneity  of  the  products, 
therefore  concluding  there  are  amplicons  for  NSs  and  L 
genes  of  RVFV. 


UrMF-MR  UrNF-NR 

MW  +  -  +  - 


Template:  SARS-CoV 
(Urbani)  Plasmid 


300  bp 


Fig.  la,  SARS-CoV  M  and  N  PCR  amplification  products. 


Fig.  lb,  RVFV  NSs  and  L  PCR  amplification  products. 


3.  Luminex  xMAP  Screening  of  Amplicons  of 
En teropa tli ogen i c  Bacteria,  SARS-CoV  and  RVFV. 

The  following  sections  describe  the  actual  application 
of  the  specific  probes  and  screening  of  amplicons  using 
the  Luminex  xMAP  system.  These  results  form  the  basis 
for  the  use  of  these  probes  to  detect  amplicons  of  threat 
agents  using  a  platform  that  will  assay  several  hundred 
samples  per  hour. 


a.  Shigella  dysenteriae,  S .flexneri,  Vibrio  cholerae, 
and  E.  coli  0157:H7.  The  designed  probe  sets  can 
specifically,  and  with  no  apparent  false  negatives/ 
positives,  detect  eight  enteric  pathogenicity  islands 
(Figure  2).  Furthermore,  Figure  2  illustrates  the  high 
degree  of  specificity  and  ability  of  probes  to  clearly 
recognize  amplicons  from  specific  pathogenicity  islands 
( hutA ,  eae,  ipaA,  stx,  ctx,  shuA,  jet,  and  iuc).  Each  bar 
represents  a  unique  bead  with  a  unique  oligonucleotide 
probe  (22  nucleotides  long).  For  seven  of  the  eight 
amplicon  sequences,  four  probe  bead  sets  were  used,  and 
for  hut  A  three  probe  bead  sets  were  used.  When  the  total 
of  3 1  probe  bound  bead  sets  were  used  to  screen  for  the  8 


individual  PCR  amplicons,  all  specific  probes  were 
recognized  as  positives. 

b.  SARS-CoV  and  RVFV.  Six  unique 
oligonucleotide  probes  were  designed  for  each  of  the  two 
amplicon  sequences  of  SARS-CoV.  Six  unique 
oligonucleotide  probes  were  designed  for  each  of  the  two 
amplicon  sequences  of  RVFV.  Each  of  the  probes  was 
covalently  coupled  to  a  unique  bead.  The  SARS-CoV  M 
amplicon  yielded  three  high  signal  intensities  relative  to 
background  non-binding  probes,  while  the  SARS-CoV  N 
amplicon  yielded  five  highly  specific  signal  intensities 
(Figure  2).  With  all  of  the  RVFV  PCR  products,  all  six 
probes  showed  high  specificity  and  signal  intensity 
(Figure  3). 

c.  False  negatives  and  false  positives.  Multiple 
probes  for  the  same  amplicon  reduce  false  positive  and 
false  negative  detection  events.  This  assertion  is  validated 
by  examination  of  Figure  2.  In  this  figure,  two 
observations  can  be  made:  the  first  is  that  probes  that 
were  not  designed  for  the  amplicons  all  yielded  a  similar 
low  signal.  The  second  is  that  only  the  appropriate  probes 
for  the  amplicons  reacted  strongly.  In  Figure  2,  only  the 
four  probes  for  amplicons  eae,  ipaA,  stx,  ctx,  shuA,  jet, 
and  iuc  reacted  with  the  appropriate  amplicon.  HutA 
reacted  with  three  probes.  Accordingly,  as  seen  in  Figure 
3,  similar  specificities  were  observed.  All  of  these 
observations  exemplify  the  reduced  likelihood  of  false 
negatives  and  of  false  positives;  however,  the  real  test  of 
false  positives  requires  testing  every  known  strain  of 
organism  in  every  known  matrix  and  this  is  impractical. 


E.  SUMMARY 

Probe  sets  have  been  designed  and  developed  to 
specifically  generate  the  PCR  products  from  genes  which 
are  characteristic  of  eight  pathogenicity  islands  in 
enteropathogenic  bacteria.  Probe  sets  for  SARS-CoV  and 
RVFV  genes  associated  with  pathogenicity  have  also 
been  prepared.  Additionally,  all  of  these  pathogenic 
genomes  have  been  subject  to  testing  using  the 
multiplexed  assays  applied  by  the  Luminex  xMAP 
System.  Importantly,  the  PCR  results  provide  a  proof-of- 
concept  for  the  development  of  the  PCR  probe  sets  which 
can  be  used  in  a  multi-PCR  assay  kit  for  the  generation  of 
amplicons  of  interest  in  a  mixture.  The  designed  probe 
sets  can  also  specifically,  and  with  no  false  negatives, 
detect  the  eight  enteric  pathogenicity  islands  and  the 
SARS-CoV  and  RVFV  sequences. 
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Fig.  2,  Hybridization  of  pathogenicity  island  amplicons  to  specific  polynucleotide  probes. 


RVFV  2H501  L  from  cDNA  RVFV  2H501  NSs  (ram  cONA  SARS  M  SARS  N 

Positive  PCR  product 


■  RVFV-L  1  ■  RVFV-L  2  PRVFV-L3  □  RVFV-L  4  ■  RVFV-L  5  ■  RVFV-L  6  ■  RVFV- NSs  1  n  RVFV- NSs  2 

■  RVFV-  NSs  3  ■  RVFV-  NSs  4  p  RVFV-  NSs  S  P  RVFV-  NSs  6  ■  SARS  CoV  M  1  ■  SARS  CoV  M2  ■  SARS  CoV  M  3  ■  SARS  CoV  M  4 

□  SARS  CoV  M  5  P  SARS  CoV  M  6  P  SARS  CoV  N  1  P  SARS  CoV  N  2  P  SARS  CoV  N  3  P  SARS  CoV  N  4  □  SARS  CoV  N  5  P  SARS  CoV  N  6 

Fig.  3,  Hybridization  of  RVFV  amplicons  (NSs  and  L)  and  SARS-CoV 

amplicons  (M  and  N)  to  specific  polynucleotide  probes. 
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